Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.
Identifieur interne : 005F05 ( Main/Exploration ); précédent : 005F04; suivant : 005F06Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.
Auteurs : Feng Gao [République populaire de Chine] ; Hong-Yu Ou ; Ling-Ling Chen ; Wen-Xin Zheng ; Chun-Ting ZhangSource :
- FEBS letters [ 0014-5793 ] ; 2003.
Descripteurs français
- KwdFr :
- Alignement de séquences, Animaux, Bases de données génétiques, Bovins, Coronavirus (), Coronavirus (enzymologie), Coronavirus (génétique), Données de séquences moléculaires, Endopeptidases (), Endopeptidases (métabolisme), Génome viral, Humains, Logiciel, Oiseaux, Polyprotéines (), Polyprotéines (génétique), Polyprotéines (métabolisme), Protéines virales (génétique), Protéines virales (métabolisme), Sites de fixation, Souris, Suidae, Séquence d'acides aminés, Virus du SRAS (génétique).
- MESH :
- enzymologie : Coronavirus.
- génétique : Coronavirus, Polyprotéines, Protéines virales, Virus du SRAS.
- métabolisme : Endopeptidases, Polyprotéines, Protéines virales.
- Alignement de séquences, Animaux, Bases de données génétiques, Bovins, Coronavirus, Données de séquences moléculaires, Endopeptidases, Génome viral, Humains, Logiciel, Oiseaux, Polyprotéines, Sites de fixation, Souris, Suidae, Séquence d'acides aminés.
English descriptors
- KwdEn :
- Amino Acid Sequence, Animals, Binding Sites, Birds, Cattle, Coronavirus (chemistry), Coronavirus (enzymology), Coronavirus (genetics), Databases, Genetic, Endopeptidases (chemistry), Endopeptidases (metabolism), Genome, Viral, Humans, Mice, Molecular Sequence Data, Polyproteins (chemistry), Polyproteins (genetics), Polyproteins (metabolism), SARS Virus (genetics), Sequence Alignment, Software, Swine, Viral Proteins (genetics), Viral Proteins (metabolism).
- MESH :
- chemical , chemistry : Endopeptidases, Polyproteins.
- chemistry : Coronavirus.
- enzymology : Coronavirus.
- genetics : Coronavirus, Polyproteins, SARS Virus, Viral Proteins.
- chemical , metabolism : Endopeptidases, Polyproteins, Viral Proteins.
- Amino Acid Sequence, Animals, Binding Sites, Birds, Cattle, Databases, Genetic, Genome, Viral, Humans, Mice, Molecular Sequence Data, Sequence Alignment, Software, Swine.
Abstract
Recently, we have developed a coronavirus-specific gene-finding system, ZCURVE_CoV 1.0. In this paper, the system is further improved by taking the prediction of cleavage sites of viral proteinases in polyproteins into account. The cleavage sites of the 3C-like proteinase and papain-like proteinase are highly conserved. Based on the method of traditional positional weight matrix trained by the peptides around cleavage sites, the present method also sufficiently considers the length conservation of non-structural proteins cleaved by the 3C-like proteinase and papain-like proteinase to reduce the false positive prediction rate. The improved system, ZCURVE_CoV 2.0, has been run for each of the 24 completely sequenced coronavirus genomes in GenBank. Consequently, all the non-structural proteins in the 24 genomes are accurately predicted. Compared with known annotations, the performance of the present method is satisfactory. The software ZCURVE_CoV 2.0 is freely available at http://tubic.tju.edu.cn/sars/.
DOI: 10.1016/s0014-5793(03)01091-3
PubMed: 14572668
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 003130
- to stream PubMed, to step Curation: 003130
- to stream PubMed, to step Checkpoint: 003156
- to stream Ncbi, to step Merge: 000403
- to stream Ncbi, to step Curation: 000403
- to stream Ncbi, to step Checkpoint: 000403
- to stream Main, to step Merge: 006391
- to stream Main, to step Curation: 005F05
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.</title>
<author><name sortKey="Gao, Feng" sort="Gao, Feng" uniqKey="Gao F" first="Feng" last="Gao">Feng Gao</name>
<affiliation wicri:level="3"><nlm:affiliation>Department of Physics, Tianjin University, 300072, Tianjin, PR China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of Physics, Tianjin University, 300072, Tianjin</wicri:regionArea>
<placeName><settlement type="city">Tianjin</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Ou, Hong Yu" sort="Ou, Hong Yu" uniqKey="Ou H" first="Hong-Yu" last="Ou">Hong-Yu Ou</name>
</author>
<author><name sortKey="Chen, Ling Ling" sort="Chen, Ling Ling" uniqKey="Chen L" first="Ling-Ling" last="Chen">Ling-Ling Chen</name>
</author>
<author><name sortKey="Zheng, Wen Xin" sort="Zheng, Wen Xin" uniqKey="Zheng W" first="Wen-Xin" last="Zheng">Wen-Xin Zheng</name>
</author>
<author><name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2003">2003</date>
<idno type="RBID">pubmed:14572668</idno>
<idno type="pmid">14572668</idno>
<idno type="doi">10.1016/s0014-5793(03)01091-3</idno>
<idno type="wicri:Area/PubMed/Corpus">003130</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">003130</idno>
<idno type="wicri:Area/PubMed/Curation">003130</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">003130</idno>
<idno type="wicri:Area/PubMed/Checkpoint">003156</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">003156</idno>
<idno type="wicri:Area/Ncbi/Merge">000403</idno>
<idno type="wicri:Area/Ncbi/Curation">000403</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000403</idno>
<idno type="wicri:doubleKey">0014-5793:2003:Gao F:prediction:of:proteinase</idno>
<idno type="wicri:Area/Main/Merge">006391</idno>
<idno type="wicri:Area/Main/Curation">005F05</idno>
<idno type="wicri:Area/Main/Exploration">005F05</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.</title>
<author><name sortKey="Gao, Feng" sort="Gao, Feng" uniqKey="Gao F" first="Feng" last="Gao">Feng Gao</name>
<affiliation wicri:level="3"><nlm:affiliation>Department of Physics, Tianjin University, 300072, Tianjin, PR China.</nlm:affiliation>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Department of Physics, Tianjin University, 300072, Tianjin</wicri:regionArea>
<placeName><settlement type="city">Tianjin</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Ou, Hong Yu" sort="Ou, Hong Yu" uniqKey="Ou H" first="Hong-Yu" last="Ou">Hong-Yu Ou</name>
</author>
<author><name sortKey="Chen, Ling Ling" sort="Chen, Ling Ling" uniqKey="Chen L" first="Ling-Ling" last="Chen">Ling-Ling Chen</name>
</author>
<author><name sortKey="Zheng, Wen Xin" sort="Zheng, Wen Xin" uniqKey="Zheng W" first="Wen-Xin" last="Zheng">Wen-Xin Zheng</name>
</author>
<author><name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
</author>
</analytic>
<series><title level="j">FEBS letters</title>
<idno type="ISSN">0014-5793</idno>
<imprint><date when="2003" type="published">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Amino Acid Sequence</term>
<term>Animals</term>
<term>Binding Sites</term>
<term>Birds</term>
<term>Cattle</term>
<term>Coronavirus (chemistry)</term>
<term>Coronavirus (enzymology)</term>
<term>Coronavirus (genetics)</term>
<term>Databases, Genetic</term>
<term>Endopeptidases (chemistry)</term>
<term>Endopeptidases (metabolism)</term>
<term>Genome, Viral</term>
<term>Humans</term>
<term>Mice</term>
<term>Molecular Sequence Data</term>
<term>Polyproteins (chemistry)</term>
<term>Polyproteins (genetics)</term>
<term>Polyproteins (metabolism)</term>
<term>SARS Virus (genetics)</term>
<term>Sequence Alignment</term>
<term>Software</term>
<term>Swine</term>
<term>Viral Proteins (genetics)</term>
<term>Viral Proteins (metabolism)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Alignement de séquences</term>
<term>Animaux</term>
<term>Bases de données génétiques</term>
<term>Bovins</term>
<term>Coronavirus ()</term>
<term>Coronavirus (enzymologie)</term>
<term>Coronavirus (génétique)</term>
<term>Données de séquences moléculaires</term>
<term>Endopeptidases ()</term>
<term>Endopeptidases (métabolisme)</term>
<term>Génome viral</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Oiseaux</term>
<term>Polyprotéines ()</term>
<term>Polyprotéines (génétique)</term>
<term>Polyprotéines (métabolisme)</term>
<term>Protéines virales (génétique)</term>
<term>Protéines virales (métabolisme)</term>
<term>Sites de fixation</term>
<term>Souris</term>
<term>Suidae</term>
<term>Séquence d'acides aminés</term>
<term>Virus du SRAS (génétique)</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="chemistry" xml:lang="en"><term>Endopeptidases</term>
<term>Polyproteins</term>
</keywords>
<keywords scheme="MESH" qualifier="chemistry" xml:lang="en"><term>Coronavirus</term>
</keywords>
<keywords scheme="MESH" qualifier="enzymologie" xml:lang="fr"><term>Coronavirus</term>
</keywords>
<keywords scheme="MESH" qualifier="enzymology" xml:lang="en"><term>Coronavirus</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en"><term>Coronavirus</term>
<term>Polyproteins</term>
<term>SARS Virus</term>
<term>Viral Proteins</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr"><term>Coronavirus</term>
<term>Polyprotéines</term>
<term>Protéines virales</term>
<term>Virus du SRAS</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="metabolism" xml:lang="en"><term>Endopeptidases</term>
<term>Polyproteins</term>
<term>Viral Proteins</term>
</keywords>
<keywords scheme="MESH" qualifier="métabolisme" xml:lang="fr"><term>Endopeptidases</term>
<term>Polyprotéines</term>
<term>Protéines virales</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Amino Acid Sequence</term>
<term>Animals</term>
<term>Binding Sites</term>
<term>Birds</term>
<term>Cattle</term>
<term>Databases, Genetic</term>
<term>Genome, Viral</term>
<term>Humans</term>
<term>Mice</term>
<term>Molecular Sequence Data</term>
<term>Sequence Alignment</term>
<term>Software</term>
<term>Swine</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Alignement de séquences</term>
<term>Animaux</term>
<term>Bases de données génétiques</term>
<term>Bovins</term>
<term>Coronavirus</term>
<term>Données de séquences moléculaires</term>
<term>Endopeptidases</term>
<term>Génome viral</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Oiseaux</term>
<term>Polyprotéines</term>
<term>Sites de fixation</term>
<term>Souris</term>
<term>Suidae</term>
<term>Séquence d'acides aminés</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Recently, we have developed a coronavirus-specific gene-finding system, ZCURVE_CoV 1.0. In this paper, the system is further improved by taking the prediction of cleavage sites of viral proteinases in polyproteins into account. The cleavage sites of the 3C-like proteinase and papain-like proteinase are highly conserved. Based on the method of traditional positional weight matrix trained by the peptides around cleavage sites, the present method also sufficiently considers the length conservation of non-structural proteins cleaved by the 3C-like proteinase and papain-like proteinase to reduce the false positive prediction rate. The improved system, ZCURVE_CoV 2.0, has been run for each of the 24 completely sequenced coronavirus genomes in GenBank. Consequently, all the non-structural proteins in the 24 genomes are accurately predicted. Compared with known annotations, the performance of the present method is satisfactory. The software ZCURVE_CoV 2.0 is freely available at http://tubic.tju.edu.cn/sars/.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
</country>
<settlement><li>Tianjin</li>
</settlement>
</list>
<tree><noCountry><name sortKey="Chen, Ling Ling" sort="Chen, Ling Ling" uniqKey="Chen L" first="Ling-Ling" last="Chen">Ling-Ling Chen</name>
<name sortKey="Ou, Hong Yu" sort="Ou, Hong Yu" uniqKey="Ou H" first="Hong-Yu" last="Ou">Hong-Yu Ou</name>
<name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
<name sortKey="Zheng, Wen Xin" sort="Zheng, Wen Xin" uniqKey="Zheng W" first="Wen-Xin" last="Zheng">Wen-Xin Zheng</name>
</noCountry>
<country name="République populaire de Chine"><noRegion><name sortKey="Gao, Feng" sort="Gao, Feng" uniqKey="Gao F" first="Feng" last="Gao">Feng Gao</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/SrasV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 005F05 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 005F05 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= SrasV1 |flux= Main |étape= Exploration |type= RBID |clé= pubmed:14572668 |texte= Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i -Sk "pubmed:14572668" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd \ | NlmPubMed2Wicri -a SrasV1
This area was generated with Dilib version V0.6.33. |